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Abstract 

In a wireless system, by pooling distributed users to form a virtual multi-antenna system for joint 
transmission, the overall quality of the communication can be greatly improved. To reduce excessive 
overhead associated with large-scale user cooperation, the number of participating users must be 
well controlled. In this work, we consider the problem of properly selecting a subset of users (from 
potentially a large number of candidates) to form the virtual multi-antenna system, while at the same 
time designing their joint transmission strategies. In the aim of designing practical algorithms with 
provable theoretical performance, we focus on a class of simple yet important scenarios in which 
either multiple transmitters cooperatively transmit to a receiver, or a single transmitter transmits to 
the receiver with the help of a set of cooperative relays. We formulate the joint problems in different 
settings as cardinality constrained programs that contain both discrete and continuous variables. We 
then leverage the technique of semi-definite relaxation to obtain approximated solutions for them. The 
effectiveness of the proposed algorithms is evaluated via both theoretical analysis as well as extensive 
numerical experiments. We expect that our approach can be applied to solve cross-layer resource 
allocation problems in many other wireless communication systems as well. 

Index Terms 

Virtual Multi-antenna Systems, Beamforming, User Grouping, Cardinality Constrained Quadratic 
Program, Semi-definite Relaxation, Approximation Bounds 

I. Introduction 

With the proliferation of multimedia rich services as well as smart mobile devices, the 
demand for wireless data has been increasing explosively in recent years. To accommodate 
the growing demand for wireless data, the efficiency of spectrum utilization for future wireless 
systems needs to be significantly increased. Enabling cooperation among the nodes in a wireless 
system is regarded as an effective means for such purpose. 

In a cellular network, cooperation can be achieved by allowing the neighboring base stations 
(BSs) to form a virtual multi-antenna system for cooperative transmission and reception. This 
scheme is referred to as the cooperative multipoint (ComP) scheme [1]. It can effectively cancel 
the inter-BS interference, and has been included into the next- generation wireless standards such 
as LTE-A (see e.g., Q-lTJl). For example, in a downlink network, assuming the users' data 
signals are known at all BSs, then either the capacity achieving non-linear dirty-paper coding 
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(DPC) (see, e.g., 0, 0), or simpler linear precoding schemes such as zero-forcing (ZF) (see, 
e-g-> 0-0) can be used for joint transmission. On the other hand, various cooperation schemes 
that exploit the spatial diversity have been proposed to allow cooperation among the mobile 
users as well ll9l- lfT5l . In these schemes, users assist each other in relaying information to the 
desired destinations, by employing different strategies such as amplify-and-forward (AF) and 
decode-and-forward (DF). When using the AF strategy, the system performance can be further 
improved if the relays jointly transmit using distributed space-time coding 0, ffUL EH- 

However, the cost of cooperation outweighs its benefit when the size of the cooperation group 
grows large. Such costs could be in the form of overhead incurred by exchanging control and 
data signals among the cooperative users (either via backhaul networks or air interfaces); it 
can also arise due to the difficulty in maintaining system level synchronization 0, 0, ifPTll . 
As a result, a pressing challenge in the design of cooperation protocols is to properly control 
the size of the cooperative groups while maintaining high system performance. 

To address this issue, various partial cooperation scheme have been developed recently. 
In the setting of cellular network, partial cooperation among the BSs amounts to judiciously 
clustering the BSs into (possibly overlapping) small cooperation groups, within which they 
cooperatively transmit to the users. In references 0, |[T8l . |fT9l , cooperation clusters are formed 
either greedily or by an exhaustive search procedures. In EOl . ([2T|. a joint BS clustering 
and beamforming problem has been formulated as a sparse beamformer design problem, in 
which the sparsity of the virtual beamformer corresponds to the size of the cooperation groups. 
Partial cooperation in the relay networks has also been studied recently. In |fl4ll . fTTSll . the 
authors proposed to select a single relay (out of many candidates) so that certain performance 
metric at the receiver is optimized. Alternatively, references 031, [|22 | -[[24 | study the multiple 
relay selection problem. In particular, the authors of ll22ll propose to increase the number of 
relays until adding an additional one decreases the received SNR. Reference [|24| formulates 
the relay selection problem as a Knapsack problem E51 . and proposes greedy algorithms for 
it. However, these schemes generally assume simplified underlying cooperation schemes after 
fixing the cooperative set. For example, references 0, lfT9l use simple zero forcing strategies 
for intra-cluster transmission; references [|22l|. Il23l assume that the cooperative relays transmit 
with full power. Moreover, there is little performance analysis for these partial cooperation 
schemes. This is due to the mixed-integer nature of the problem when including the group 
membership (which is a discrete variable) as optimization variables. 

In this paper, we study the problem of optimally partitioning the users into cooperation 
groups, while at the same time designing their cooperation strategies. We focus on a class of 
simple yet important scenarios in which either multiple transmitters cooperatively transmit to 
a receiver, or a single transmitter transmits to the receiver with the help of a set of cooperative 
relays. The objective is to optimize the system performance measured by the receive signal 
to noise ratio (SNR), while explicitly constraining on the size of the cooperation group. We 
formulate the problems in different settings to cardinality constrained programs, and show 
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that these problems are NP-hard in general. We further develop efficient algorithms using 
the technique called semi-definite relaxation (SDR) ll26l . Il27l . and analyze the quality of the 
proposed algorithms in terms of their gap to the global optimal solutions. The main novelty of 
this work lies in introducing the framework of SDR for the design of efficient partial cooperation 
schemes. Moreover, our work represents the first attempt to generalize the SDR approach (and 
its associated approximation bounds) to the mixed-integer cardinality constrained programs. 

The rest of the paper is organized as follows. In Section HH we discuss the system model. 
Section ITTll-Section IVl consider the joint user grouping and virtual beamforming problems in 
various settings. Section [VI] presents numerical results. The concluding remarks and future 
works are provided in Section IVIIl 

Notations: For a symmetric matrix X, X y signifies that X is positive semi-definite. We 
use Tr[X], X^, and Aj(X) to denote the trace, hermitian, and the 2-th largest eigenvalue of a 
matrix, respectively. Let diag(X) denote a matrix consisting the diagonal value of X. Let S 
be an index set, we use X[i, j] and X[«S] to denote the (i, j ) -th element of X and the principal 
submatrix of X indexed by the set S, respectively. We use x[i] and x[«S] to denote the i-th 
element of the vector x and the subvector of x with the elements in set S, respectively. For a 
complex scalar x, its complex conjugate is denoted by x. Let 5ij denote the Kronecker function, 
which takes the value 1 if i — j, and otherwise. I n denotes an n x n identity matrix. We 
use M. NxM and C NxM to denote the set of real and complex N x M matrices. We use to 
denote the i-th unit vector and use e to denote the all 1 vector. 

II. System Model 

Suppose there are a set M. — {1, • ■ • , M} of transmitters each equipped with a single an- 
tenna, and there is a single receiver with N > 1 receive antennas. Let h i>n E C denote the chan- 
nel between transmitter i and the n-th antenna of the receiver, and let h n = [hi tTl , • • • , liM,n} H ■ 
Suppose only second order statistics on the channels are available, that is, we only know 
E[h n hf ] = Rn >- 0, for n = 1, - ■ ■ , N. Let z ~ CJ\f(0, 1) denote the (normalized) noise at the 
receiver. We restrict ourselves to the particular form of cooperation in which the cooperative 
nodes can form a linear virtual beamformer for joint transmission. Let Wi E C denote the 
complex antenna gain for user i, which satisfies an individual transmit power constraint: 
\wi\ 2 < P. Define w = [w\, ■ ■ ■ , w M ] H ■ 

When all the transmitters participate simultaneously in the beamforming, they transmit the 
same data signal to the receiver by using distinct antenna gains. When the system is viewed 
as a downlink network, such data sharing can be done via low-latency backhaul networks that 
connects the BSs. Alternatively, in the uplink, the users can share their data signals by the 
following steps: 1) identify the user whose data is to be transmitted; 2) the identified user 
broadcasts its data to all nearby users, who subsequently decode the data. For now we assume 
that when the system is viewed as an uplink network, the hop connecting the source and the 
cooperative users is reliable, in the sense that all the cooperative users can perfectly decode the 
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signals in step 2). This assumption is reasonable as the cooperative users are usually located 
in close proximity (e.g., within the same office room). Consequently the bottleneck of the 
communication usually lies in the hop connecting the users to the BS. Later in Section |Vj we 
will consider the virtual beamforming problem in a relay network, which can incorporate the 
quality of the first hop communication into the formulation as well. 

To simplify presentation, below we focus on an uplink case in which the users transmit to a 
BS. It should be understood, though, that our discussion is equally applicable to the downlink 
case (i.e., the ComP scheme) when the role of transmitters and receivers are reversed. 

Assume that the BS performs spatially matched filtering/maximum ratio combining (which 
is equivalent to the MMSE receiver in this case), then the total received signal power can be 
expressed as: J2n=i |h^w| 2 = 2~2n=i h n h^w. Let R = J2n=i ^[R-n]- The averaged signal 
to noise ratio (SNR) at the BS is then given by: SNR = E 



N l __.H h 



EiV 
71 = 1 



w ff Rw. To optimize 



the averaged SNR at the BS, one can solve the following quadratic program (QP) 



max w ff Rw 



(1) 



s.t. 



< P, i = 1, • 



,M. 



As will be seen later in Section IIII-Bl this seemingly simple problem turns out to be compu- 
tationally very difficult for general channel covariance matrix R. 

It is worth mentioning that when the instantaneous channel states {h n } are available, the sum 
SNR maximization problem is of the same form as in ([I]), with R replaced by R = 2~2n=i hnh^. 
All our subsequent results carry over to this case as well. The reason that we interpret £0 as 
optimizing the averaged system performance is towfold: 1) it may be difficult to obtain {h n } 
exactly when the number of nodes is large; 2) using instantaneous channel states for user 
grouping inevitably results in frequent change of group membership, which is usually costly. 




Base Station/Receiver 



Base Station 




Fig. 1. Illustration of joint user grouping and virtual Fig. 2. Illustration of joint relay selection and virtual 
beamforming. Users 1-3 and users 4, 5 are divided into beamforming. Relays 1-3 are chosen as the serving 
two different groups. relays. 

In practice, when the number of users becomes large, allowing all of them to cooperate at 

the same time induces heavy signaling overhead (related to users' exchange of data and control 
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signals) and computational efforts (related to computing the optimal virtual beamformer for all 
the users) ifTTl . To address these issues, we propose to dividing the users into different coop- 
erative groups while at the same time optimizing their virtual beamformers. We first consider 
a relatively simple case in which the aim is to find a single cooperative group with a fixed 
size. Such admission control problem is important as fixing the size of group can effectively 
control the cooperation and computational overhead. We then consider two more general cases 
in which either the users need to be allocated into groups that are served in orthogonal time 
slots, or there is an additional hop of transmission to be taken into consideration. 

III. Joint Admission Control and Virtual Beamforming 
A. System Model 

Let Q denote the desired size of the cooperation group, and introduce the set of binary 
variables a, E {0, l},i E M to indicate the users' group membership: when a, = 1, user i is 
being assigned to the cooperation group. Let a = [a±, - ■ ■ , ajw]. Then the joint user grouping 
and beamforming problem is given as the following cardinality constrained program 

vf p = max w fl Rw (CP1) 

w.a 

s.t. |w t | 2 < ajP, i = !,-■■ ,M 

M 

Y^a.i = Q, di G {0, 1}, i = 1, ■ • • , M. 

Note that cij = implies \wi\ 2 = 0, that is, user i does not transmit. In the following, we will 
use ff p (w) to indicate the objective value achieved by a feasible solution w. 

Introducing a homogenizing variable 7 E { — 1,1} and change the domain of the discrete 
variables to { — 1, 1}, the problem (|CP1I) can be equivalently reformulated as 

max w ff Rw 

w,a,7 

p 

s.t. w H e t ef w + — {di - 7) 2 < P, i = 1, • ■ ■ , M 

M 

^(a 4 + 7 ) 2 =4Q, 
i=i 

a, £{-1,1}, i = !,■■■ ,M, 7 G{-1,1}. 
After such transformation, we see that 7 x = 1 implies W; L ^ 0, i.e., user i joins the cooperative 
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group. To further simplify the problem, we introduce the following set of definitions 

C 4 ,o = ^e., + -e M+1 e M+1 - -e,e A/+1 - -e M+1 e t , C ia = e 4 e t — (2a) 

D !: 4 blkdg[C ij0 , Ci,i] e C ( 2M+1 )x( 2M+1 ) (2b) 

R ^ blkdg[0, R] e C (2A/+1 > x ( 2A/+1 ) (2c) 



B ^ 



€ K (M+l)x(M+l) 5 B = blkdg [ Bo ,0] 6 M (2M+l)x(2M+l) (2d) 



I e 

e T M 

x = [a , 7, w J xq = [a , 7J , xi = w. (2e) 



Then the problem (ICPlb can be further simplified as 

max x ff Rx (Rl) 

X 

s.t. x H DiX < 1, i = !■■ ■ ,M (3a) 
x H Bx = 4Q, (3b) 
e{-l,l}, i = l,--- ,M + 1. 

We emphasize again that in this new notation, x[i]x[M + 1] = 1 implies x[M + 1 + i] ^ 0. 

5. r/ze Complexity of the Problem (ICP1I) 

In fact, solving problem ( ICP1I) (or equivalently problem (IR1I) ) is challenging. In this sub- 
section, we show that this problem is strongly NP-hard. To this end, it is sufficient to show 
that even when fixing the values of the binary variables {a;}, the problem is still NP-hard. 

Let S C M. with \S\ — Q denote the support of a feasible solution w to problem (ICP1I) : 
S = {i : a,i = 1}. When S is fixed, problem (ICP1I) is equivalent to the following QP 

max w ff R[5]w (QP1) 

weCQ 

s.t. \wi\ 2 <P,i = l,--- ,Q. 
In the following we analyzes the computational complexity for ( |QP1[ ). 

Proposition 1 Solving the problem ( |QP1| ) is NP-hard in the number of users. 

Proof: We only prove the claim with real variables. The complex case can be derived 
similarly. The claim is proved by a polynomial time reduction from a known NP-complete 
problem called equal partition problem If25l , which can be described as follows. Given a vector 
c consisting of positive integers ci, • • • , cq, decide whether there exists a subset X such that 

5l> = E«- (4) 

i=l iel 

Suppose c T c = C. Let R[S] = (-cc T + 2CI Q ) y 0. The claim is that the problem ( |QP1| ) 
has the maximum value of 2CQP if and only if there exists a set X satisfying ©. With this 
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specific structure of R[«S], the objective of problem ( |QP1| ) can be written as 

Q Q 

w T R[5]w = -|w T c| 2 +2C^] \w t \ 2 < 2CJ2\^\ 2 ^ 2CQP, whenever \w t \ 2 < P. (5) 

i=i i=i 

Consequently, the maximum value for problem ( |QP1[ ) is 2CQP iff — |w T c| 2 = and \wi\ 2 = P. 
This is equivalent to the existence of an index set X such that © is true. ■ 
We note that the difficulty in solving problem ( ]QP1| ) can be mainly attributed to the set of 
individual power constraints. If we only have a total power constraint in the form of ||w|| < P, 
then this problem is simply the principal component analysis (PCA) problem. In this case, the 
original problem (ICP1I) becomes one that finds a Q x Q principal submatrix of R that yields 
the maximum eigenvalue (or equivalently, the sparse PCA problem, see ll28l ). This problem is 
still difficult to solve, but there are efficient heuristic algorithms for it, see ||28l , ||29l . 

Remark 1 The result obtained in Proposition Q] asserts that ( |QP1| ) is difficult to solve for 
general R. However when R admits certain special structures, both ( |QP1| ) and (ICP1I) may be 
easy to solve. One such example is that when R is a rank 1 matrix, i.e., R = rr H for some 
vector r £ C M . This channel model may be of interests as it corresponds to the special case 
in which the receiver has a single antenna, and the instantaneous SNR is considered. Another 
relevant special case is that when R is a diagonal matrix, which happens when all the users' 
channels are independent. For both cases, problems ( |QP1| ) and (ICP1I) are separable among the 
variables, and their solutions can be easily obtained in closed form. 

C. The Semi-definite Relaxation 

Our proposed algorithm is based on the technique called semi-definite relaxation (SDR), 
which has been widely used to solve problems in communications and signal processing ll26ll . 
We emphasize that unlike conventional SDR methods (such as those surveyed in 112610 . in which 
the problems to be relaxed have either all continuous (e.g., G710 or all discrete (e.g., POlD 
variables, our problem (ICP1I) (or equivalently dED)) is of mixed integer nature. Consequently 
our algorithm and analysis differ significantly from those developed in the literature. 

We first introduce two semi-definite programs. Define a variable X = xx H . Define two index 
sets X = {1, • • • , M + 1} and X = {M + 2, • • • , 2M + 1}. Then X = X[X] and X x = X[X] 
denote the leading and trailing principal submatrices of X, respectively. Clearly X = x Xq 
and Xi = xixf\ Moreover, we have Rank(X) = 1 and X [i, j] E { — 1, 1} for all i,j £ X. The 
following semi-definite program is a relaxation of (IR1I) . by removing the non-convex constraint 
Rank(X) = 1 and by replacing X [z, j] £ {-1, 1} by X [z, j] £ [—1,1], for all i,j £ X: 

«f DP = max TrfRXl (SDP1) 

s.t. Tr[DiX] < 1, « = !••• ,M (6a) 
Tr[BX] = AQ (6b) 
X[i,i] = l,i = !,••• ,M + 1. (6c) 
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Note that we did not explicitly include the conditions X [z, j] £ [—1, 1], for all i, j £ X, as it 
can be ensured by the set of conditions (l6cl) and X ^ 0. As the above problem is a relaxation 
of problem (IR1I) . we must have t>^ DP > t>^ p . Denote the optimal solution for this problem as 
X*. Without loss of generality we can assume X* = blkdg[Xo, X*]. 

Similarly, let Y = ww H £ C QxQ . The following problem is a relaxation of the problem 
( |QP1[ ), for a given index set S C M with \S\ = Q 

max Tr[R[51Yl (7a) 

Y^O 

s.t. Y[i,i] <P,i = !,■■■ ,Q. (7b) 

Let us denote the optimal solution of this problem by Y*. 
The following remarks summarize some useful properties of the these two problems. 

Remark 2 At optimality, the set of constraints (l6al) and dTbl) must be all tight. 

We show the tightness for (l6al) . The case for (TTbl) can be argued similarly. Suppose on the 
contrary, there exists i £ M. such that Tr[DjX*] < 1. Then from the definition of Dj in (l2b~l) . 
we have: TrfC^iX*] + Tr[C i|0 Xo] < 1, which is equivalent to: pXJM < | + ±X5[i,M + l]. Due 
to the assumption that the inequality is strict, we can find a constant 5 > such that 

ix;[*,i] + *=i + ix5[i,M + i]. 

As a result, let X : = X* + c^ef ^ 0, then X = blkdg[X*, XJ is also feasible for (ISDP1I) . 
This alternative solution achieves the following objective 

vf up (X) = Tr[R(X* + Saef)] = Tr[RX*] + <5Tr[Re ie f ] > Tr[RXJ], (8) 

where in (a) we have used that fact that 5 > and R[i, i] > (due to the strict positive 
definiteness of R). Clearly, the inequality ([8]) is a contradiction to the optimality of X*. 
In conclusion, we have 

^X* 1 [i,i] = ^ + ^X* [i,M + l], V * = !,••• ,M. (9) 



Remark 3 The sum of the last column of Xq admits a closed form expression. Note that 
the cardinality constraint ([60) implies that 2 Y^=i x o[^ M + 1] + 2M = 4Q, which is further 
equivalent to: J2tLi x o[^ M + 1] = 2Q - 71/. 

D. r/ze Proposed Algorithm 

In this section, we propose a randomized algorithm that generates an approximated solution 
to the problem (ICP1I) . To highlight, we first list the main steps of the algorithm: 

1) Compute the optimal solution X* of the relaxation problem (ISDP1I) : 

2) Determine the discrete variables x and the set S according to X^; 
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3) Fixing S, compute the optimal solution Y* of problem dTal) ; 

4) Randomly generating a sample of feasible w's using Y*; 

5) Select the solution that achieves the best objective value for problem (ICP1I) . 

The distinctive feature of the proposed algorithm, compared to many SDR-based algorithm, 
is mainly in steps 2)-4), in which the solution X* is converted to a vector w with prescribed 
number of zero components. It is worth noting that the proposed algorithm generates an 
approximated solution with guaranteed quality (see Section UlI-EI for detail). 

To formally describe the proposed algorithm, the following definitions are needed. Let S C 
M. be an index set. Decompose Y* by Y* = A H A. Then define 

n(5)^^e,efel MxM 

E,- = ACj ! i[5]A if ! E = AR[<S] A ff . 

Note that for an arbitrary MxM matrix, multiplying Q(S) from both right and left leads to 
a matrix that has nonzero elements only in rows and columns indexed by S. Let us further 
decompose E by: E = USU H . Then the diagonal matrix S can be expressed as 

£ = U H EU = U ff AR[S]A H U. (10) 

Define % = U^E^U. 

Let us use L to denote the sample size of the randomization procedure, and use (I) to denote 
the index of a random sample. Let X(q) and respectively denote the Q-th largest value in 
the sets {X$, M + 1]}^ and {R^z']}^. The proposed algorithm is listed in Table H 

TABLE I 

The Proposed Algorithm for Admission Control 



S 1 : Compute the solution X* of problem ( ISDP1I ) 
S2: Find a set T of indices such that |T| = Q and 
T={j :X* \j,M + l] >x (Q) }; 

S2a: If Tr[Rfi(T)Xjn(T)j > Tr[R], let S = T; 

S2b: Else Let S = { j : R[i, i] > r (Q) }; 

Let S = M \ S; 
S3: Set x [M + 1] = 1 and x [j] = 1, for all j e S; 

Set x [i] = -1 for all i e S; 
S4: Compute the solution Y* of problem ( TTab with index set S; 
For t= l,--- ,L 

S5: Generate £ { — 1, 1}® by randomly and independently 
generating its components uniformly from { — 1,1}; 

S6: Compute <W = yJmaxi €S (£ w ) T Ei£ w ; 
S7: Compute = ^rA ff U| ( ^; 

Let wW[5] = wM and w^[S] = 0; 
End For 

S8: Compute £* = arg £ max(wW) i/ Rw' f '; let w* = w( r '; 
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This algorithm can be viewed as a generalization of the algorithm developed by Nemirovski 
et al. and Zhang et al. OTTl . 11321 for approximating continuous quadratic programs. The novelty 
of this algorithm mainly lies in Step S2)-Step S3), in which discrete variables are determined. 
Below we motivate Step S2). Note that without rank relaxation, X = x Xq is the block 
variable representing the discrete variables, so it is reasonable to select cooperative groups 
using the elements of this matrix. Furthermore, recall that in problem (|RH) . a user % joins the 
cooperative group if x [i]x [M + 1] = 1, which, combined with the definition X = x Xq , 
implies that X [i,M + 1] = 1. Ideally, we should form the cooperative group by choosing Q 
elements in the set S = {i : Xjjfi, M + 1] = 1, i G Ai}. However it is possible that |<S| < Q, 
as we have relaxed the non-convex condition X [i,j] G { — 1,1} to X [i,j] G [—1,1]. As a 
result, we instead choose the largest Q elements in the set {X.*[i,M + 1]}^ X . Step 2a)-2b) 
are some technical refinement of the selection procedure that are needed later for the proof of 
the approximation bounds. We note here that in our numerical experiments, we find that the 
cooperative set is almost always determined by the set T, and Step 2b) rarely gets executed. 
The reason for using Step S4)-S8) to generate a feasible solution w* can be explained by the 
following two remarks. 

Remark 4 The objective value of the problem (ICP1I) evaluated at a solution is given by 

{twy 

- qvt^w - w Tr ^ - w Tr[R[5]Y1 

where (a) is from (|26l ); (b) is from the fact that S is diagonal, and the diagonal elements of 
£ W (£ W ) T are all 1; (c) is from the definition of S. 

Remark 5 We claim that for all I = 1, ■ • • , L, the solution = [x^, (w^) H ] H is a feasible 
solution for the problem (IR1I) . Moreover, is a feasible solution to (1CP1I) . 

First we show that the constraint (x^') H D.jX^^ < 1 is always satisfied for alH G Ai. To see 
this, we consider the following two cases (we omit the superscript (£) for notational simplicity). 

Case i): If i G S, then we have 

x ff D 2 x = (w) H C M w + (x ) H C ii0 x 

= (ft(S)w) H C M ft(S)w + (x ) H C\ x 
= (w[5]) i? C i ,i[5]w[5] + (x ) ff C 4 ,ox 

= ^ T U ff AC M [5]A ff U| + i(l - xo[*]xo[M + 1]) 

(a) 1 T ~ 

where (a) is from the definition of Ej and from the fact that x [M + 1] = 1 and for all i G S, 
x [i] = 1; (b) is from the definition of t in Step S6) of the algorithm. 
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Case ii): If i e S, then from Step S7) of the algorithm, we have Wi = 0. Using the fact 
that C^i = e^ef we have: x H D t x = (wJ^C^iw = |w'd 2 -p = 0. It is straightforward to see that 
the cardinality constraint x^Bx = AQ is satisfied for each solution w^, as this constraint 
mandates that M — Q users do not transmit. 

In summary, we conclude that x is feasible for the problem (IR1I) . Due to the equivalence 
between problems (IR1I) and (ICP1I) , w is also feasible for the latter problem. 



E. The Analysis of the Quality of the Solution 

We are now ready to analyze the quality of the solution generated by our proposed algorithm. 
Naturally, such quality should be measured by comparing the achieved objective to the globally 
optimal objective v^ p . More specifically, we would like to find a constant a± > 1 such that: 
uf p (w*) > ^-uf p . The constant ol\ is referred to as the approximation ratio of the solution w*. 
The smaller the value of ol\, the better the quality of the solution w*. However, due to the 
NP-hardness of solving problem (IR1|) . p is generally difficult to compute. Alternatively, in 
the following we will find a constant oti that satisfieswp p (w*) > ^-fi DP - Note that the latter 
inequality implies the former, as vf DP > v^ p . We have the following results regarding to the 
quality of solution w*. 

Theorem 1 If w* is generated using the algorithm in Table then with high probability, we 
have up p (w*) > ^vf DP , with a± bounded by 

8MAi(R) 
EfliAi(R) 

Proof: To show Theorem [TJ below we first show that for any £ = 1, • • • , L, there exists a 
constant a\ satisfying 1 < a\ < oo such that: Prob (vf p (w^) > ^-Vi DP ) > 8 > 0, or equivalently, 

Prob [ — ]— Tr[R[5]Y*] > — Tr[RXJ] ] > 8 > 0. (13) 



a i ^ r ln (5Q) < 8Mln(5Q). (12) 



(tW) 2 L L J J " at 

That is, with positive probability, the solution generated by the proposed algorithm is at 
least as good as ^- fraction of v SDP . It follows that the probability that the solution w* achieves 
an objective that is at least ^ of v f DP is given by 

Prob f maxt)P(w (£) ) > — v? p 
\ * on 

> Prob fmaxvHwW) > — vf>A = l - Pro b Lax W f p (w( £ )) < — vf BP ) 
V 1 a x J \ i ax J 

= 1 - JJProb ^ CP (wW) < ^^ DP ) > 1 - (1 - 8) L . 

Clearly, this probability approaches 1 exponentially as L becomes large. 

Below we will show (TT3T ). The analysis is divided into the following three steps. 
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Step 1): Let (3 > be a constant. We first show that when ^- is small enough, with zero 
probability the event Tr[R[S]Y*] < ^Tr[RX^] happens. 

To this end, we first lower bound Tr[R[«S]Y*]. We have the following two cases. 
Case i) Suppose S = T = { j : X{[j, M + 1] > x}, then we have 



„SDP 



(i) 



(ii) 



Tr[R[5]Y*] > Tr[R[5]XJ[5]] = Tr[n(S)Rn(S)n(S)X;|;n(S)] W Tr[RfJ(S)X*fl(S) 



where (i) is from that fact X*[«S] is a feasible solution to the problem (TTal) . and that Y* is the 
optimal solution for that problem; (ii) is because f2(<S)f2(<S) = Q(S). From Step S2) of the 
algorithm, we must have Tr[Rft(«S)XJft(«S)] > Tr[R], it follows that 

OP 

Tr[R[S]Y*] > ^-Tr[R]. (14) 

Case ii) Suppose S = {j : R[i,i] > r}. Then we have 

(0 (") QP 

Tr [R[5]Y*] > P x Tr[R[«S]I Q ] > ^Tr[R] (15) 

where (i) is again from that fact PIq is a feasible solution to the problem (TTal) . and that Y* is 
the optimal solution for that problem; (ii) is from the fact that each i E S is among the largest 

Q elements in the set {R[M]}^=i> which leads to Y^ies z 1 — mE^i^[M] = ^Tr[R]. 

We then upper bound Tr[RX*]. By a trace inequality for the product of two semi-definite 
matrices, we have that ll33l 

M M 

Tr[RXt] < J2 A * ( X *i) A *( R ) < Ai(R)^ A, (XJ) = A x (R)Tr[X;]. (16) 



Utilizing this result, we have 



M 



Tr[RXJ] < Ai(R)Tr[XJ] ^ PAi(R)£ ( 5 + ^oih^+l 



= PAi(R) ( 


v T ~ 


1 M 

i=l 


( = ] PAi(R) 


/2Q 


- M 


V 2 



QPAi(R) (17) 



where (i) is from the tightness of the first set of constraints of the problem (ISDP1I) (cf. ©) 
and (ii) is due to Remark |2] and Remark [3] Comparing (fT5l) and (TTVl) . we see that choosing 



^ maTP) ensures 

Prob f Tr[R[S]Y*] < -^-Tr| 
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Step 2): For fixed a±, we bound the probability that Prob f ^jyp < as follows (omitting 
(£) for simplicity) 



Prob [ — < -J = Prob(t 2 > /3) = Prob ^max(|) Ej£ > P 
< ^Prob ((^) T E^ > /?) < 4Q/iexp f-| 

where /i = min[Q, max j Rank (Ej)]. The last inequality is obtained by slightly generalizing 

]0. 



the existing result in Oil Proposition 1] Lj. To explicitly compute the value for \i, note that by 
definition, we have 

Ej = U H E,U = U fl ACu[5]A fl U. 

As a result, Rank(Ej) < 1 as by definition Rank(Cj j i) = 1, and any one of its principal 
submatrix must have rank at most 1. We conclude that < ji < 1. 
Step 3): Utilizing the above result, choose 

/3 = 81n(5Q), ai = &M T ^ ] ln(5Q), 

we can bound the left hand side of ([131) as follows 

Prob ( 4Tr[R[S]Y*l > — Tr[RXf 

> Prob ( Tr[R[S]Y*] > — Tr[RXj], \ > \ 

\ Oil t z p 



> 1 - Prob ( Tr[R[5]Y*] < — Tr[RX^] ) - Prob ( — < 



|T,[RX;])-Pr„b(i<i 



( 1 < >l-4Qexp(-ln(5Q)) =1-4Q/(5Q) = ^. 



In conclusion, the final approximation ratio is given by 

^ = H5Q) = ln(5Q) " 8Mln(5Q) - 

This completes the proof. ■ 
We see that if the eigenvalues of the channel covariance matrices R are roughly uniformly 
distributed, then the derived bound is of the order 0{ln(Q)). On the other hand if R has a 
single dominant eigenvalue, then the bound is of the order 0(M\n(Q) + M). Nevertheless, 
it is important to note that the theoretical approximation ratio obtained above characterizes 
the quality of the worst solutions. As we will see later in our numerical results, in practice 
the proposed algorithm seems to generate solutions that have a much better quality than the 
derived worse-case bound. 



'The cited result can be generalized from the real case to the complex case using the complex form of the large deviation 
results from Bernstein, see e.g., 1341 Theorem 4], and the recent results by Zhang and So 1321 . 
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IV. Joint User Scheduling and Virtual Beamforming 
A. Problem Formulation 

In this section, we consider the case where there are two orthogonal time slots available for 
transmission. The problem is to effectively allocate Q users to the first slot and the rest M — Q 
users to the second one, so that the minimum SNR among these two time slots is maximized. 
Such min-SNR maximization problem aims at providing fairness to both groups of the users. 

Let w k E C M denote the virtual beamformer used in the k-th time slot, and let Wk ,j G C 
denote user i's antenna gain in slot k. Mathematically, the problem is given by 

u? p = max min wf R(w fc ) (CP2) 

{wi,w 2 ,a}fci,2 

s.t. \wi4 2 <OiP, |w 2 ,*| 2 < (l-Oi)P, i=l,---,M 

M 

Y^a,i = Q, en e {0,1}, i = 1,- •• ,M. 

?:=i 

In the following, we will use -U2 P (w l7 w 2 ) to denote the objective achieved by a feasible tuple 
(wi,w 2 ). Let B , C i)0 and C^i defined the same as in (l2al)— (l2eT). Further define 

Ai,i ^blkdg[C i , ,C i ,i,0], A i>2 4blkdg[C il0 ,0,C i>1 ] € C (3M+1 ) X ( 3A/+1) 

B ^ blkdg[B , 0, 0] G E (3M+1)X(3M+1) 

A r T /) T TlT A r T «1 A A 

x = [a ,£,w 1 ,w 2 J , x = [a ,£J, Xi = Wi, x 2 = w 2 . 

Then problem (ICP2I) can be equivalently written as 

max min x^Rxj, (R2) 

s.t. x^A^ix < 1, i = 1- •• ,M 
x^A ii2 x < 1, i = 1 • • • , M 
x fl Bx = 4Q, x[i] G {-1, 1}, i = 1, • • • , M + 1. 



B. Complexity Status 

Clearly this max-min scheduling problem is at least as difficult as its admission control 
counterpart, as when fixing the group membership, the subproblem of maximizing per-group 
SNR is the same as (JQPTJ). To see this, we again fix an index set S C M. with |«S| = Q, and 
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let S = Ai \ S. Then the problem (ICP2I) reduces to the following two QPs, one for each slot 

max wfR[5]wi (18) 

Wi 

s.t. | Wl [z]| 2 <P, i = l r .-,Q 
wi e C Q 

max wfR[5]w 2 (19) 

w 2 

s.t. |w 2 W| 2 <P, i = l,-,M-Q 
w 2 g C M "«. 

Clearly these problems have the same form as problem < |QP1[ ). Interestingly, unlike the admission 
control problem, the scheduling problem is difficult even when R is diagonal or is of rank 1 . 

Proposition 2 Solving problem \CP2\ is strongly NP-hard in general. 

Proof: Below we show the case when R is diagonal. The claim is proved by using a 
polynomial time reduction from the equal partitioning with equal cardinality problem. Let M 
be a even number and set Q = y. Given a vector c G M. M consists of positive elements 
ci, ■ ■ • , cm, let C = J2iLi c i> the equal partitioning with equal cardinality problem finds an 
index set X with |X| = 4f such that ^ = J2iei c i- 

Let R = diag(c). We claim that determining if problem (ICP2I) can achieve an optimal value 
of is NP-hard. Let X denote the set such that X = {i : a* = 1} with |X| = 4f. Then the 
objective of (ICP2I) can be written as 

(' 1^2 j | 2 / < niin < P Cj, P ( 
j<£X J [ iGX jfz 

where the inequality is achieved by setting \wi yi \ 2 = P, \w 2j i\ 2 = 0, V i E X, \wi ti \ 2 = 
0, \u)2,i\ 2 = P, V i ^ I. Clearly, checking if we can find an index set X so that this problem 
can achieve an objective value of ^f- is equivalent to finding a subset X with |X| = 4^ satisfying 
y = ^ igX Cj, which is exactly the equal partitioning with equal cardinality problem. 
The case when R is of rank 1 can be shown similarly. ■ 

C. The Proposed Algorithm 

Define X e E( M+1 ) X ( M+1 ), Xi,X 2 G E MxM , and let X = blkdg[X , X 1; X 2 ]. The semi- 
definite relaxation of problem (|R2|) is given by 

^ DP = max min Tr[RX fc ] (SDR2) 




v 2 

X>-0 k=l,2 



s.t. Tr[Ai,iX] < 1, i = 1 • • • , M (20a) 

Tr[A i:2 X] < 1, i = 1 ■ ■ ■ , M (20b) 

Tr[BX] = 4Q (20c) 
X[i,i] = l,i = !,••• ,M + 1. 
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Similarly, for a fixed index set S, the semi-definite relaxations of problem (fl"8T) and (fl~9l ) are 



max TrfR[51Yil (21a) 

s.t. Yi[i,i] < P,i = I,--- ,Q, (21b) 

max Tr[Rf51Y 2 l (21c) 

Y 2 ^0 

s.t. Y 2 [i,i] <P,i = !,-•• ,M-Q, (2 Id) 



The following properties are unique to the solutions for problem (ISDR2I) . 
Remark 6 Denote the events Ai(X) and *A 2 (X) by 

.Ai(X) = {all the inequalities in ( I20ab arc tight forX} 
^4.2 (X) = {all the inequalities in ( I20bl i are tight for X}. 

We claim that there must exist an optimal solution X* for problem (ICP2I) such that both events 
.4i(X*) and *4. 2 (X*) happens simultaneously. 

We first argue that for any optimal solution X* of (ICP2I) . at least one of Al(X*) and .4 2 (X*) 
is true. Assume the contrary, then there must exist at least one inequality in (|20al) and one 
inequality in (120bl) that are both inactive. Without loss of generality, assume that TrfA^iX*] < 1 
and Tr[A i 2 X*] < 1. Equivalently, we have 

^x;[7,j] < l(i + xsb,A/ + 1]), ix5[i,i] < xs[i,M + i]). 

Then there must exist two constants 8\ > and <5 2 > such that 

i(X?[j, j] + «5x) = 1(1 + Xg [j, A/ + 1]), -1(X 2 *M + S 2 ) = 1(1 - XS[i, Af + 1]). 

Then X = blkdgfXp, X* + e^ej^i, X^ + e f ef <5 2 ] is also feasible for problem (ICP2I ). Moreover, 
the objective becomes: min{Tr[R(X^ + e J ej5i)],Tr[R(X2 + etefSi)]}, which is strictly larger than 

min{Tr[RX*], TrfRX^]}. This is in contradiction to the optimality of the solution X*. 

We then argue the claim made in the remark. Suppose for an optimal solution X* and that 
Ai(X*) is true, while ^4 2 (X*) is not. Then it is easy to see that the following is true 

min{Tr[RX*] , Tr [RX^] } = Tr[RX*] . (22) 

Otherwise one can always increase the objective without violating the constraints. Then there 
exist a set of constants Si > 0, % — 1, • • • , M such that 

-l(X 2 *[i, i] + Sitae?) = 1(1 - X$[i, M + 1]) 

while still maintaining (l22l) . Thus, X = blkdgfXg, X£, X2 + YlfiLi $i e i e I ] is also an optimal 
solution to problem (ISDR2I) . The remaining case when *4. 2 (X*) is true and Ai(X*) is not can 
be argued similarly. 
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Remark 7 At optimality, the property for the sum of the last column of Xq in Remark [3] still 
holds true. Further, suppose for an optimal solution X* both events Ai(X*) and *4. 2 (X*) are 
true. Then we have the following 

M M 

£ XJ [i, i] = - £ (1 + X$[i, M + 1]) = - (M + (2Q - M)) = PQ (23) 

i=l t=l 
M M 

[*, *] = - J2 0- - M + !]) = y (m _ (2 ° ~ M)) = p(il/ ~ 0) - (24) 

i=l " i=l 

To formally describe the proposed algorithm, the following definitions are needed. Decom- 
pose XJ, Xg by X^ = Af Ai and Xg = Af A 2 , respectively. Let 5CXbe any index set, 
define the following 

Ei(5, A fc ) ^ A fc C M [S] Af , E(5, A fe ) 4 A fc R[S] Af . 
Let us decompose E(«S, A fc ) using its eigendecomposition 

E(S,A k )=\J(S,A k )'E(S,A k )V H (S,A k ). (25) 
Then the diagonal matrix S(«S, A fc ) can be expressed as 

E(S, A fe ) = U H (5, A fc )E(5, A fe )U(«S, A k ) = U H (S, A k )A k K[S] Af U(5, A fc ). (26) 

Further define %(S, A k ) 4 U"(S, A fe )E,(5, A fc )U(5, A fc ). 

The proposed algorithm is listed in Table [Til It has similar steps as the admission control 
algorithm. The main difference is that after deciding the set S, two separate SDPs need to be 
solved, one for each set S and S. The final solutions and Wj are then obtained using the 
randomized algorithm base on the solutions of these two SDPs. Note that to find a solution 
required by Step SI), we can start by any optimal solution X* of (ISDR2I) . and increase the 
diagonal elements of X^ or X2 until all the constraints in (120ab and (|20b| ) are satisfied. 

Using the argument identical to that presented in Remark |4] and Remark [5j we can verify 
that for all i, [x , wf , w^] must be feasible for problem (IR2I) . Moreover, we have that 

(wf'^Rwf = — i— Tr[R[5]Y£], (wf) fl Rwf = — i— Tr[R[5]Y|]. 

(*i ) 2 (4 t 

D. 77ze Approximation Bounds 

We have the following results regarding to the quality of the solution x*. 

Theorem 2 7/"x* generated according to the algorithm listed in Tabled then ^(w*, Wg) > 
^uf DP , wzY/z a 2 given a* 



a2 = ■ /n 8 Af Al m\ ^ 111(12 max{Q ' M - Q}) - (27) 
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TABLE II 

The Proposed Algorithm for Joint Beamforming and Scheduling 



SI: Compute the optimal solution X* of problem (|SDR2t , 

such that all the constraints in d20al > and d2Qb| ) are tight. 
S2: Find a set S of indices such that |<S| = Q and 
S = {j :X* \j,M + l] >x (Q) } 

Set S = M \ S; 
S3: Set x [M + 1] = 1 and x [j] = 1, for all j G S; 

Set x [i] = -1 for all i e S; 
S4: Compute the solution Y*, of problem dTHJ and ( fT9b , 

with index sets S and S, respectively; 
For£= I,--- ,L 

S5: Generate £^ G {-1, 1} Q and ^ G {-1, 1} M -Q by randomly and 
independently generating their components uniformly from { — 1,1}; 



S6: Compute = ^ nuix ie5 (g^) T E i (5, Ai)^ ; ; 

Compute 4 £) = v /ma Xie<s -(^ ) ) T E l (5,A 2 )^ ) ; 
S7: Compute = Af U(5, A^f 5 ; 

wW=^A?U(5,A,)^; 
Let w W[5]=wfl wf 5 [5] = 0; 

Ixtw^ = ^,^[S| = 0i 
End For 

S8: Set I* = arg £ max f min{(wf } ) ff Rwf \ (wf) ff Rwf}; 
Set wj = wf >, = wP, x* = [xg\ (w^) T , (w^) T ] T 



Proof: The proof of this theorem follows the similar steps as in Section IIII-EI Again our 
goal is to show that there exits a 5 > such that 

Prob C min {(wf) s Rwf } > A- min {Tr[RX^}) > 6 > 0. 

\ re— 1,2 Q?2 /c— 1,2 / 

Note that we have 

Prob ( min{(w[ f) ) ff Rw[ f) | > J_ min {TrfRXtl} 

\k=i,2 fc a 2 fc=i,2 

= Prob L m |_^_ Tr[R[5] Yl] > _l_Tr[R[5]Y*]| > JL mm {Tr[RX£]}J 
> Prob (min {Tr[R[<S]Y*], Tr[R[<S] Y2]} > £ min {Tr[RX£]}, ^_ > I, > I 

We first lower bound TrfRfjSjYj] as follows 

Tr[R[«S]Y*] > Tr[R[«S]X;[«S]] = Tr[ft(S)Rft(S)X;] 
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where the inequality is from the optimality of Yg as well as the feasibility of X^fiS] to the 
problem (fT9l) . The right hand side of the above inequality can be further lower bounded by 



ies 



Ti[n(s)R,n(s)x*} > Tr[n(s)x* 2 n(s)}x M (R) = ^x;[m]a m (R) 

i^)(M-Q-^,M + l] 



ies 



where (i) is because of the tightness of the constraint (|20bl) . We argue the inequality (ii) as 
follows. Due to Step S2) of the algorithm, {Xq[z, M + l]}j Gl s are the smallest M — Q elements 
in {X.* [i,M + l}}fl 1 . Combining with the fact that XJjfi, M + l] = Q, we must have that 
^2 ie $ Xq[z, M+l] < ^Tj Q, which further implies (b). Combining the above two inequalities, 
we obtain Tr[R[<S]Y*] > P{M M Q)2 \ M (R). 

Using the same argument, we can lower bound TrfR^Y] 1 ] by: Tr[R[5]Y^] > ^A M (R)- 

As a result, we have 

mm{Tr[R[<S]Y?],Tr[R[<S]Y|]} > min{Q 2 , (M - Q) 2 }^A M (R). 

Utilizing the results in Remark [Vj we further have 

Tr[RX^] < Ai(R)Tr[X^] = X 1 (R)QP 
Tr[RX^] < Ai(R)Tr[XJ] = Ai(R)P(M - Q) 

which implies miri fc= i. 2 {Tr[RX£]} < Ai(R)Pmin{(3, M - Q}. Consequently, when choosing 
P Ai (R)P min{<5, M - Q} Ai(R)M 



we have 



Q'2 min{Q 2 , (M - Q) 2 }P^Am(R) Am(R) min{Q, M - Q} 



Prob ( min{Tr[R[«S]X^],Tr[R[«S]X;]} < A min{Tr[RX£]} ) = 0. 



Going through similar steps as in Step 2)-Step 3) in the proof of Theorem [T] the final ratio is 

a * = ■ sn m nL m\ hl ( 12 max W' M - Q V- (29) 
mm{Q, M - Q}\m(R) 

This completes the proof. ■ 
Clearly for a given channel covariance R, the bound developed above achieves its minimum 
when the two slots have the same number of users, i.e., when M = 2Q. On the other hand, 
the ratio becomes worse if the the number of users allocated to the slots are increasingly 
unbalanced. We mention that it is possible to extend our approach to scenarios when multiple 
orthogonal time-slots are available. However, unlike the 2-slot case, more discrete variables 
are required (M discrete variables for each slot). Our current algorithm and its approximation 
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bound need to be carefully modified for this more complicated case. We leave this problem as 
a future research direction. 

V. Joint Relay Grouping and Virtual Beamforming 

The approaches we have developed in the previous sections are applicable to a more general 
relay network, where the problem is to group the relays to form a virtual multi-antenna system, 
and at the same time design their virtual beamformers for optimal signal relaying. 

A. System Model 

Consider a network with a pair pair of transceiver and a set of M relays, each of which has 
a single antenna (see Fig. |2] for an illustration). Assume that there is no direct link between the 
transmitter and the receiver. Let {/i}^ and {<7»}|fi denote the complex channel coefficients 
between the user and the relays, and between the relays and the receiver, respectively. 

Let us focus on a popular AF relay protocol, in which the transmitter broadcasts the desired 
signal to the relays, who subsequently forward the signals to the receiver. Assume that there 
is a large number of relays available, and any group of them can form a virtual multi-antenna 
system for transmission. For the same reason as in the user grouping problem such as overhead 
reduction and synchronization, it is of interest to select a subset of relays to joint the AF process. 

To formally present the system model, we use s G C to denote the message transmitted 
by the transmitter; use P to denote the transmit power; use z/j to denote the noise at the 
i-th relay with power cr 2 . Then the signal X{ received at the 2-th relay can be expressed as 
Xi = y/PofiS + Vi. Again use Wi to denote the complex gain applied by the i-th relay, it follows 
that the transmitted signal of i-th relay is given by yi = WiXi. Using this expression, we readily 
obtain the averaged transmit power of relay i 

E[\ yi \ 2 } = M 2 E[x^] = H 2 (P E[|/;| 2 ] +a 2 v ) . 
Let n G £A/"(0, cr 2 ) denote the noise at the receiver, then the received signal is given by 

M M M 

z = ^2 giVi + n = \f~P~Q ^2 WifigiS + ^ w l g l v l + n . (30) 

i— 1 i—1 i—1 



signal 

The averaged signal power at the receiver is then given by 



E 



M 

Mi 



i=l 



(31) 



where S = P E[(f O g)(f O g) H ], with Q denoting the componentwise product. Assuming {ui]fii 
and {gi}^ are independent from each other, the averaged noise power is given by IfTTI 



E 



M 

n 



i=l 



w H Fw + al, (32) 
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where F = E[(g Q^)(gO v) H ], Additionally, if we further assume that the noises {z^}|£i are 
independent, then F becomes diagonal: F = er 2 diag(E[gg ff ]). It follows that the averaged SNR 
at the receiver is given by [fTT]| : SNR = cr 2 V ^ w fl^ w ■ The joint relay grouping and beamforming 
problem can be formulated as 



.CP 



w H Sw 



max 



a" a'}, + w ff Fw 

2/DicTli |2i , _2 



(CP3) 



st |u> 4 | 2 (P E[|/ l | 2 ]+^) < ai P, i = l,---,M, 



^a, =Q, ai €{0,1}, i = !,-■■ ,M. 

i=X 

Once again, we have used the variables a = [ai, ■ • • , clm] G {0, 1} M to denote the binary 
grouping decisions. This problem can be compactly written as 



QP 

UcT = max 



x fl Fx 



s.t. x H T> i G l x<l, i = !,-■■ ,M, 

x ff Bx = 4Q, x[i] e {-1, 1}, i = 1, ■ • • , M + 1. 



(R3) 

(33a) 
(33b) 



where Dj, B and x are given in (I2ab-(l2eb. and the (2M + 1) x (2M + 1) matrices F, S and 
G, are defined as 





s 

Im+i 




F 



e,e 



,M. 



B. The Complexity Status 

In this subsection, we analyze the numerical complexity of joint relay grouping and virtual 
beamforming problem. The following theorem shows that solving (ICP3I) is generally NP-hard. 



Theorem 3 Solving (ICP3I) is NP-hard in general. 

Proof: It suffices to show that solving (ICP3I) is NP-hard even when the active relays are 
known. In other words, it is sufficient to show that solving the problem 

w fl Sw 



max — tj— — 

w ai + w H Fw 



(35) 



s.t. 



(p<M[\fi 



cr, 



;) < p, 



M, 
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is NP-hard. We prove by using a polynomial time reduction from the integer equal partitioning 
problem. To this end, let us consider the following system parameters: 

P = l, P = 2, a 2 v = l, /i = l,Vi. 
Then the problem (|35l) can equivalently be written as 

w H E[gg ff ]w 



w (j2 _|_ -vv-ffdiag (E[gg H ]) w 
s.t. \wi\ 2 < 1, i = !,-•• ,M. 



(36) 



Let t denote the objective value of the above problem. In order to check the achievability of a 
particular value t, we need to check the feasibility of the following set of inequalities: 

w ff (E[gg ff ] - tdiag (E[gg ff ])) w > to* 
\ Wl \ 2 < P, Vi. 



Therefore, it suffices to show the NP-hardness of checking the achievability of (1371) . To this 
end, we first claim that for given a positive definite matrix A, the following set is non-empty 

T = {(i,X) | X-idiag(X) = A,A^O,i>0,tGl,Xe r mxjv/| 
To justify this claim, consider the mapping 0a(^) : ^ + ^ M MxM , where (j>x{t) = B with 



B 



A i:j i y£ j 



l-t L 



Note that <\)pSf) is continuous over [0, 1) and 0a(O) = A. Therefore, for small enough t' > 0, 
we have </>a(0 = B' >- 0. In other words, (t', B') G T. The above claim implies that there 
exists a positive definite matrix R = E[gg^] and a positive scalar t such that 



Efgg"] - tdiag {E[gg H }) = 2C1 - cc 



T 

) 



with C = ||c|| 2 . Therefore, using the similar argument to the one in the proof of Proposition Q] 
completes the proof. ■ 
Unlike the general case, problem (ICP3I) is easy when there is no correlation between the 
channels, i.e., when the matrices S and F are both diagonal. In fact, for a fixed value of t, 
checking the feasibility problem 



> t 



M 

^a, = g, a, €{0,1} 

i=l 

is easy since we only need to pick the Q largest positive diagonal elements of S — tF and set 
the corresponding = 1. Using this observation and performing a bisection procedure on t, 
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we can obtain the optimal solution for (ICP3I) . In conclusion, (ICP3I) is polynomial time solvable 

when both S and F are diagonal. 

Similar to problem (|CP2I) . one can consider the relay scheduling problem over two time 
slots. Mathematically, this problem can be formulated as 

max min — — h —n jl — (CP4) 

{wi,w 2 ,Q!} fc=l,2 a^ + wffFWk 

s.t. \w ltl \ 2 (P E[\f % \ 2 } + a 2 v ) <a t P, \w^\ 2 (P E[\f t { 2 } + a 2 ) < (1 - Oi)P, i = l,---,M 

M 

y ^2 / a i = Q, <h G {0,1}, i = 1,- •• ,M. 
It turns out that this problem is NP-hard even for the special case of diagonal channel correlation. 

Theorem 4 For diagonal channel correlation matrices, problem (|CP4I) is NP-hard. 

Proof: Assume that fi = l,Vi It suffices to show that checking the feasibility problem 

. wfE(gg g )w fc 
mm — ± — — > t 

fe=i,2 al + w£ diag (E[gg H ]) w fe 

l^i.^l 2 (PoEH/,1 2 ] + o\) < a t P, \w 2 A 2 (PoE[|/,| 2 ] + a 2 v ) < (1 - a^P, i = l,---,M (38) 

M 

^ai = Q, a, £{0,1}, i = !,-■■ ,M, 
i=l 

is NP-hard. Let us also consider the following system parameters: 

E[| 5 ,| 2 ] = * >0, P = a 2 v = l, P = 2, Q=y, f = 
Using these parameters, the feasibility problem (1381 ) can be simplified as 



^ + EKil 2 ci " 2' 

|wi,i| 2 <ai, l^a.il 2 < (1-ai), i = l,--. ,M 
M M 

^a, = y, a< G {0,1}, i= l,--- ,M 

i=l 

Set cr^ = J2i c «/2- Using the above parameters, one can easily show that problem (l39l) is feasible 
iff the following NP-complete equal partitioning with equal cardinality problem is feasible: 
Find an index set Z C {1,2,..., M} with |Z| = 4f such that c « = Siex c c «- 
Such equivalence completes the proof. ■ 
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C. The Algorithm 

Again let us define X = xx H . The semi-definite relaxation of the reformulated problem 
(IR3I) is given by 

t| DP ^max Tr[SX i (SDP3) 
xto CT 2 + Xr[FX] 

s.t. Tr[DiGiX] < 1, i = 1, • • • , M, (39a) 
Tr[BX] = 4Q, (39b) 
X[i,i] = 1, i = I,- - • ,M + 1. (39c) 

Let X* denote the optimal solution of this problem. Clearly, we must have Tr[SX*] = ff DP (cr^+ 
Tr[FX*]). Moreover, X* must be the optimal solution of the following problem, with an optimal 
objective value t>f DP of t 

max Tr[(S - vf DP F)X] (SDP4) 
s.t. 



This problem is a relaxation of the following QP 

max x H (S-^ DP F)x 
s.t. (f33ab - d33bl. 

The similarity between problem (ISDP4I) and the relaxed user selection problem (ISDP1I) 
suggests a natural two-step approach for obtaining a feasible solution x* of (IR3I) : 

1) Solve problem (ISDP3L obtain wf DP . 

2) Obtain x* = [xq, w*] by applying the algorithm in Table U with the matrices R, Dj 
replaced by S — t>f DP F, DjGj, respectively. 

Using the same argument given in Remark [51 we can show that the resulting vector x* must 
be feasible for our relay selection problem. 

Remark 8 If we can show that x* achieves a ^- fraction of the optimal value of problem 
(ISDP4I) . then it follows that 



2,, SDP 



(w*) w (S - ^ up F)w* = (x*) w (S - v| up F)x* > — Tr[(S - ^ UP F)X*] = -=-2 — . 
Using the fact that a 3 > 1 and that (w*) H Fw* > 0, we obtain 

^ p (w*) - (x*) g Sx* > — v* DP 

which says that w* is also an a 3 -approximated solution for problem (ICP3I) . 

Unfortunately, to this point we are not able to obtain a formal approximation bound for the 
solutions so obtained. The main difficulty is that, unlike in problem (ISDP1I) . the coefficient 



25 



matrix S — ff DP F in the objective of (ISDP4I) is no longer a positive definite matrix (in fact, it is 
an indefinite matrix). Consequently, key properties of the solution for the (ISDP1I) that guarantee 
the existence of the lower/upper bounds of the objective value such as those described in Remark 
|2] no longer hold true in this case. 

Nevertheless, in our numerical experiment, we observe that the algorithm proposed in this 
section generates high quality approximated solutions for problem (ICP3I) . 

VI. Numerical Results 

In this section, we conduct numerical experiments to evaluate the proposed algorithms. For 
all simulations presented, we choose the total number of randomization as L = 200. 

A. Grouping for Admission Control and Scheduling 

We evaluate the performance of the proposed algorithm for joint admission control and 
beamforming. Let N > be a constant, which represents the number of antennas at the receiver. 
We model the channels as following the Rayleigh distribution, and generate the channel matrix 
R as follows: 1) generate the entries of a matrix H E C MxN i.i.d from a standard complex 
Gaussian distribution; 2) set R = HH H y 0. Consider a network with M = 30 users with each 
user having the same transmit power. Let the total transmission power of the users, denoted 
as P tot , set to be P to t = 10 dBW. The proposed algorithm in Section [III] (abbreviated as Al. 
1) is compared with a sparse PCA based algorithm, whose main steps are listed below: i) 
Approximately find a sparse principal component of R with Q non-zero entries (denoted as 
w), using the backward-forward algorithm proposed in ||29l ; ii) normalize w by a constant e 
so that all the individual power constraints are satisfied. In its first step, this algorithm tries to 
find the set of users that, when replacing the individual power constraints with a total power 
constraint, can provide the (approximately) best averaged receive SNR. 

Table [III] demonstrates the maximum, the minimum and the averaged ratios achieved for 
running the algorithms on 100 independent random generations of the channel. Note that the 
approximation ratio for a solution w* is calculated by a = , , . 



TABLE III 

Approximation Ratio of The Proposed and SPCA Algorithms 





Al. 1 Min 


Al. 1 Mean 


Al. 1 Max 


S-PCA Min 


S-PCA Mean 


S-PCA Max 


N=10, Q = 


1.29 


1.49 


1.79 


1.78 


2.57 


5.22 


N=20, Q = f 


1.30 


1.40 


1.54 


1.57 


2.43 


4.36 


N=30, Q = f 


1.22 


1.35 


1.62 


1.70 


2.39 


3.51 


N=10, Q=™ 


1.20 


1.35 


1.52 


1.75 


2.59 


3.58 


N=20, Q = 4f 


1.18 


1.30 


1.47 


1.51 


2.49 


4.06 


N=30, Q = 4f 


1.16 


1.28 


1.45 


1.57 


2.46 


4.05 



In Fig. [3] we plot the performance of the algorithms for different sizes of the network. For 
a given network size, we choose Q to be y and let N = M. For each network (Q, M) pair, 
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the algorithm is run for 100 independent realizations of the channel covariance matrices. We 
again plot the maximum, the minimum and the averaged approximation ratios achieved among 
those 100 realizations. In Fig. @] similar experiment is conducted with fixed number of group 
size O = 10 for all network sizes. 



Min-SPCA 
Max-SPCA 
Average-SPCA 

O Min-Alg. 1 

-0-Max-Alg. 1 
Average-Agl. 1 




30 40 50 

Number of Users (M) 



-* Min-SPCA 
-x-Max-SPCA 

Average-SPCA 
O Min-Alg. 1 
-0-Max-Alg. 1 

Average-Alg. 1 




40 50 
Number of users (M) 



Fig. 3. Approximation ratio for admission control with 
different network sizes. M G [10,20,30,40,50,60,70], 
Q = (f), Ptot = WdB, N = M, 100 independent 
realizations of channel. 



Fig. 4. Approximation ratio for admission control with dif- 
ferent network sizes. M G [20,30,40,50,60,70], Q = 10, 
P tot = 10dB, N = M, 100 independent realizations of 
channel. 



In Table [TV] we demonstrate the performance of the proposed algorithm for the max-min 
scheduling problem (abbreviated as Al. 2). Our algorithm is compared with the following two 
heuristic benchmark algorithms: 1) randomly partition the users into two groups of size Q and 
M — Q, and for each group solve a PCA problem followed by a normalization step (abbreviated 
as R-PCA); 2) randomly partition the users into two groups of size Q and M — Q, and obtain 
a solution w* and following steps S4)-S8) in Table IH (abbreviated as R-SDR). 

From the above results, it is clear that both of the proposed algorithms deliver high quality 
solutions (in terms of both the averaged and the worse case performance) to the joint admission 
control and beamforming problem. 

TABLE IV 

Approximation Ratio of the Algorithm for Scheduling 





Al. 2 Mean 


Al. 2 Max 


R-PCA Mean 


R-PCA Max 


R-SDR Mean 


R-SDR Max 


N=10, Q = f 


1.22 


1.99 


4.54 


10.06 


2.14 


4.25 


N=30, Q = f 


1.32 


2.03 


4.15 


7.08 


1.89 


3.19 


N=10, Q = f 


1.55 


2.06 


5.07 


9.89 


2.37 


4.32 


N=30, Q = 4f 


1.36 


2.09 


4.29 


9.75 


2.09 


3.35 



B. Grouping for Relay Selection in Relay Networks 

In this section, we demonstrate the performance of our algorithm for the joint relay selection 
and virtual beamforming problem (abbreviated as Al. 3). 

For the relay network, the channel covariances are generated as follows lfTTll . |[T2l . Assume 
fi can be written by fi — fi + fi, where fa is the mean of fa and f t is a zero-mean random 
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variable. Assume that /j, fj are independent for i ^ j, and choose fa = e> 6 1 / \fffjl and var(/) = 
77^/(1+77^). In this notation, 0j is a uniform random variable on the interval [0, 2ir], and 77^ 
is a parameter that determines the level of uncertainty in the channel coefficient fa. Similarly, 
let g { = gi + (ji, with ^ and defined similarly as and /j. For intuition behind this channel 
model, we refer the readers to ifTTTl . lfT2l . In our simulation below, {77^} and {rj gi } are generated 
randomly from —10 dB to 10 dB, accounting for different uncertainty levels for different users. 




Number of Users (M) 



Fig. 5. Approximation ratio with different network sizes. Fl g- 6 - Averaged received SNR for different algorithms. 
M 6 [10,20,30,40,50,60], Q = (f ), P tot = lOdBW, M 6 [10,20,30,40,50,60], Q = (4f), P tot = lOdBW, 
100 independent realizations of channel. 100 independent realizations of channel. 

Our algorithm is compared against the following three algorithms: 

• Random-GED: This is a variant of the generalized eigenvalue decomposition (GED) based 
algorithm proposed in ([TT1 Section IV-B]. In its original form, all the relays are utilized, and 
the algorithm computes an approximated solution of the max-SNR problem by performing 
a PCA for the matrix S _1 F, followed by a normalization step to ensure the individual 
power constraints. To incorporate the selection of relays, we first randomly select Q out 
of M relays, and then perform the PCA and normalization steps. 

• Random-SDR: In this algorithm, we first randomly select Q out of M relays in the network, 
and then perform the two-step SDR algorithm similarly as the one proposed in Section |V] 
(with fixed user- grouping). 

• Greedy algorithm: This algorithm largely follows from the one proposed in [|22|. in which 
users are successively and greedily added to the cooperation set as long as by doing so can 
improve the received SNR level. As the original algorithm does not impose hard cardinality 
constraint, we slightly modify it by keep adding the users utile the required group size Q 
is reached. Note that this algorithm requires the knowledge of perfect channel states. 

In Fig. [5]-[6l the approximation ratio and the achieved receive SNR of different algorithms 
is plotted against the size of the network. In Fig. |7]-[8l similar performance metrics are shown 
when we increase the relays' transmission powers. In both of these two figures we set P = 
dBW and Q = 4^. We see that the proposed algorithm is close to the optimal solution across 
all values of individual power constraints and for all network sizes simulated, while the other 
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three algorithms perform relatively worse in all the experiments. Furthermore, the proposed 



algorithm has significantly better "worse case" performance than the random-SDR algorithm 
(see Fig. \5\ and Fig. [7]). 




Fig. 7. Approximation ratio for different algorithms. M = Fi g- 8 - Averaged receive SNR for different algorithms. M = 
30, Q = ( M), Ptot = [o, 5 , 10, 15, 20, 25, 30, 35, 40] dBW, 30, Q = (f ), Ptot = [0, 5, 10, 15, 20, 25, 30, 35, 40] dBW, 
100 independent realizations of channel. 100 independent realizations of channel. 



VII. Discussion and Concluding Remarks 

This paper addresses the important problem arises in the virtual multi-antenna systems, 
namely the joint user grouping and the virtual beamformer design problem. In the aim of 
designing practical algorithms with provable theoretical performance, we focus on simple (yet 
important) scenarios in which either multiple transmitters cooperatively transmit to a receiver, 
or a single transmitter transmits to the receiver with the help of a set of cooperative relays. 

For different application scenarios, the problems are formulated as cardinality constrained 
programs, and efficient algorithms based on semi-definite relaxation are proposed. The quality 
of the solutions is evaluated both theoretically and via numerical simulation. Furthermore, we 
have established the complexity status for various problems studied in this work, which we 
summarize below in Table |V] 

TABLE V 

Summary of Complexity Results 





Admission Control 


Scheduling (2-Slot) 




General 


Diagonal 


Rank 1 


General 


Diagonal 


Rank 1 


VMIMO 


NP-Hard 


Ploy. Time Solvable 


Ploy. Time Solvable 


NP-Hard 


NP-Hard 


NP-Hard 


VMIMO-Relay 


NP-Hard 


Ploy. Time Solvable 


Unknown 


NP-Hard 


NP-Hard 


Unknown 



The proposed approach to virtual beamforming can be extended in many interesting direc- 
tions. Theoretically, it is desirable to develop approximation bounds for the relay selection 
problem as well as to further sharpen the bounds for the scheduling problem so that it is 
independent of the problem data (that is, independent of the channel covariance matrices). 
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Practically, we expect that our approach can be applied and extended to many other interesting 
scenarios, even beyond the virtual multi-antenna systems. For example, in the user selection 
problem, to mitigate the interference to a neighboring co-existing system, one could further 
impose a total interference constraint of the form E[|w H g| 2 ] < / to the selected group of users, 
where g represents the channel between the users and the BS of the neighboring system (e.g., 
see ll32l for an application of this model in the cognitive ratio network). By using the similar 
algorithm as proposed in Section|Inl it is possible to show that the obtained solution achieves an 
approximation ratio of a 4 = ^7^M§j ln(5(Q + 1)) with G = Efgg*]. Another interesting 
direction will be to extend the proposed algorithm to uplink networks with multiple groups 
of users interfering with each other, or to relay networks with multiple pairs of transceivers 
(similarly as those discussed in |fl2lQ. Additionally, the proposed approach may prove to be 
useful in designing future multi-cell cellular systems, where not all, but a subset of BSs can 
jointly transmit and receive signals for a given mobile user (see recent work ll20l . [|2T|. Il35l 
on the so-called partial ComP technique). 
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